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Abstract — Low-density parity-check (LDPC) convolutional co- 
des are capable of achieving excellent performance with low 
encoding and decoding complexity. In this paper we discuss 
several graph-cover-based methods for deriving families of time- 
invariant and time-varying LDPC convolutional codes from 
LDPC block codes and show how earlier proposed LDPC 
convolutional code constructions can be presented within this 
framework. 

Some of the constructed convolutional codes significantly 
outperform the underlying LDPC block codes. We investigate 
some possible reasons for this "convolutional gain," and we also 
discuss the — mostly moderate — decoder cost increase that 
is incurred by going from LDPC block to LDPC convolutional 
codes. 

Index Terms — Block codes, convolutional codes, low-density 
parity-check (LDPC) codes, message-passing iterative decoding, 
pseudo-codewords, pseudo-weights, quasi-cyclic codes, unwrap- 
ping, wrapping. 



I. Introduction 

IN THE LAST fifteen years, the area of channel coding 
has been revolutionized by the practical realization of 
capacity-approaching coding schemes, initiated by the inven- 
tion of turbo codes and their associated decoding algorithms 
in 1993 A few years after the invention of the turbo 
coding schemes, researchers became aware that Gallager's 
low-density parity-check (LDPC) block codes and message- 
passing iterative decoding, first introduced in 121, were also 
capable of capacity-approaching performance. The analysis 
and design of these coding schemes quickly attracted con- 
siderable attention in the literature, beginning with the work 
of Wiberg IS), MacKay and Neal pl|, and many others. An 
irregular version of LDPC codes was first introduced by Luby 
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et at. in IS), 161, and analytical tools were presented in Q, 
[8] to obtain performance limits for graph-based message- 
passing iterative decoding algorithms, such as those suggested 
by Tanner For many classes of channels, these tools have 
been successfully employed to design families of irregular 
LDPC codes that perform very well near capacity ifTOl . ifTTIl . 
Moreover, for the binary erasure channel these tools have 
enabled the design of families of irregular LDPC codes that are 
not only capacity-approaching but in fact capacity-achieving 
(see I.12J and references therein). 

The convolutional counterparts of LDPC block codes are 
LDPC convolutional codes. Analogous to LDPC block codes, 
LDPC convolutional codes are defined by sparse parity-check 
matrices, which allow them to be decoded using iterative 
message-passing algorithms. Recent studies have shown that 
LDPC convolutional codes are suitable for practical imple- 
mentation in a number of different communication scenarios, 
including continuous transmission and block transmission in 
frames of arbitrary size ifTSl - lfTSl . 

Two major methods have been proposed in the Uterature for 
the construction of LDPC convolutional codes, two methods 
that in fact started the field of LDPC convolutional codes. 
The first method was proposed by Tanner ifTSl (see also 
ifTTll . ifTSll ) and exploits similarities between quasi-cyclic block 
codes and time-invariant convolutional codes. The second 
method was presented by Jimenez-Feltstrom and Zigangirov 
in lfT9l and relies on a matrix-based unwrapping procedure to 
obtain the parity-check matrix of a periodically time-varying 
convolutional code from the parity-check matrix of a block 
code. 

The aims of this paper are threefold. First, we show that 
these two LDPC convolutional code construction methods, 
once suitably generalized, are in fact tightly connected. We 
establish this connection with the help of so-called graph 
covers0 A second aim is to discuss a variety of LDPC convolu- 
tional code constructions. Although the underlying principles 
are mathematically quite straightforward, it is important to 
understand how they can be applied to obtain convolutional 



' Note that graph covers have been used in two different ways in the LDPC 
code literature. On the one hand, starting with the work of Tanner 1201 , they 
have been used to construct Tanner graphs (9l of LDPC codes, and therefore 
parity-check matrices of LDPC codes. Codes constructed in this way are 
nowadays often called proto-graph-based codes, following the influential work 
of Thorpe t21|, who formalized this code construction approach. On the other 
hand, starting with the work of Koetter and Vontobel 1221 , 1231 , finite graph 
covers have been used to analyze the behavior of LDPC codes under message- 
passing iterative decoding. In this paper, we will use graph covers in the first 
way, with the exception of some comments on pseudo-codewords. 
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codes with good performance and attractive encoder and 
decoder architectures. A third aim is to make progress towards 
a better understanding of where the observed "convolutional 
gain" comes from, and what its costs are in terms of decoder 
complexity. 

The paper is structured as follows. After some notational 
remarks in Section II-AI we discuss the basics of LDPC 
convolutional codes in Section In particular, in that section 
we give a first exposition of the LDPC convolutional code 
construction methods due to Tanner and due to Jimenez- 
Feltstrom and Zigangirov. In Section |lll] we discuss two types 
of graph-cover code constructions and show how they can be 
used to connect the code construction methods due to Tanner 
and due to Jimenez-Feltstrom and Zigangirov. Based on these 
insights. Section HVl presents a variety of LDPC convolutional 
code constructions (along with simulation results), and in Sec- 
tion |V] we mention some similarities and differences of these 
constructions compared to other recent code constructions in 
the literature. Afterwards, in Section |VT] we analyze some 
aspects of the constructed LDPC convolutional codes and 
discuss some possible reasons for the "convolutional gain," 
before we conclude the paper in Section IVIII 



A. Notation 

We use the following sets, rings, and fields; Z is the ring 
of integers; Z^o is the set of non-negative integers; F2 is 
the field of size two; F2[X] is the ring of polynomials with 
coefficients in F2 and indeterminate X; and ¥2[X]/ (X'^—l) 
is the ring of polynomials in F2[X] modulo X'' — 1, where 
r is a positive integer. We also use the notational short-hand 
F^''^[X] for ¥2[X]/(X''-l). 

By F5 and F^''^[X]", we mean, respectively, a row vector 
over F2 of length n and a row vector over V^'' [X] of length 
n. In the following, if M is some matrix, then [M]j,i denotes 
the entry in the j-th row and i-th column of M. Note that we 
use the convention that indices of vector entries start at (and 
not at 1), with a similar convention for row and column indices 
of matrix entries. (This comment applies also to semi-infinite 
matrices, which are defined such that the row and column 
index sets equal Z^o-) The only exception to this convention 
are bi-infinite matrices, where the row and column index sets 
equal Z. Finally, Mi (E> M2 will denote the Kronecker product 
of the matrices Mi and 7^2- 



II. LDPC Convolutional Codes 

This section defines LDPC convolutional codes and dis- 
cusses why they are interesting from an implementation 
perspective. Afterwards, we review two popular methods of 
obtaining LDPC convolutional codes by unwrapping block 
codes. Later in this paper, namely in Section |III] we will use 
graph covers to show how these two methods are connected, 
and in Section |IV] we will see how these two methods can 
be implemented and combined to obtain LDPC convolutional 
codes with very good performance. 

A. Definition of LDPC Convolutional Codes 

A semi-infinite binary parity-check matrix as in ^ at the 
top of this page defines a convolutional code Cconv as follows. 
Namely, it is the set of semi-infinite sequences given by 



where ( • )^ denotes the transpose of a vector or of a matrix. 

We comment on several important aspects and properties of 
the code Cconv and its parity-check matrix Hcom- 

• If the submatrices Hi{t), i = 0,1,-- - , mg, t e Z^o^ 
have size (c — &) x c with b < c, then Cconv is said to 
have (design) rate R^b/c. 

• The parameter iris that appears in ([T) is called the 
syndrome former memory. It is an important parameter 
of Cconv because the maximal number of non-zero sub- 
matrices per block row of iJconv is upper bounded by 

TTlg + 1. 

• The quantity Vs = (mg + 1) • c is called the constraint 
length of Cconv- It measures the maximal width (in 
symbols) of the non-zero area of .ffconvQ 

• We do not require that for a given i = 0, 1, . . . , mg the 
submatrices {ffi(t)}tgz>o are independent of t, and so 
Cconv is in general a time-varying convolutional code. 

« If there is a positive integer Tg such that Hi{t) = Hi{t + 
Ts) for all i = 0, 1, . . . , mg and all t G Z^o, then Tg is 
called the period of Hconv, and Cconv is periodically 
time-varying. 

• If the period Tg equals 1, then iJconv is called time- 
invariant, and the parity-check matrix can be simply 

-Strictly speaking, tlie above formula for gives only an upper bound 
on the maximal width (in symbols) of the non-zero area of J/conv . but this 
upper bound will be good enough for our purposes. 
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written as 

Ho 

Hi Hq 

HconV ^ TT TT TT 

n-m^ -flma-l • • • -Ho 

Hm^ H„i^-l . . . Hq 



• If the number of ones in each row and column of Hconv 
is small compared to the constraint length i^g, then Cconv 
is an LDPC convolutional code. 

• An LDPC convolutional code Cconv is called (m,, J, K)- 
regular if, starting from the zeroth column, Hconv has J 
ones in each column, and, starting from the (mg + l) • (c — 
6)-th row, Hconv has K ones in each row. If, however, 
there are no integers mg, J, and K such that Cconv is 
(ms, J, A ) -regular, then Cconv is called irregular 

Of course, there is some ambiguity in the above definition. 
Namely, a periodically time-varying LDPC convolutional code 
with parameters Tg, b, and c can also be considered to be a 
periodically time-varying LDPC convolutional code with pa- 
rameters Tg' = Ts/i, b' = £-b,c' = £-c, and R' = b'/c' = b/c 
for any integer £ that divides Tg. In particular, for £ ~ we 
consider the code to be a time-invariant LDPC convolutional 
code with parameters b' = ■ b and c' = Tg • c. 

B. Implementation Aspects of LDPC Convolutional Codes 

An advantage of LDPC convolutional codes compared to 
their block code counterparts is the so-called "fast encoding" 
property. As a result of the diagonal shape of their parity-check 
matrices, many LDPC convolutional codes enjoy simple shift 
register based encoders. Even randomly constructed LDPC 
convolutional codes can be formed in such a way as to achieve 
this feature without any loss in performance (see, e.g., lfT9ll . 
1241 '). On the other hand, in order to have a simple encoding 
procedure for LDPC block codes, either the block code must 
have some sort of structure IZSll or the parity-check matrix 
must be changed to a more easily "encodable" form ll26l . 

The difficulty in constructing and decoding LDPC convolu- 
tional codes is dealing with the unbounded size of the parity- 
check matrix. This is overcome at the code construction step 
by considering only periodically time-varying or time-invariant 
codes. The code construction problem is therefore reduced to 
designing just one period of the parity-check matrix. For de- 
coding, the most obvious approach is to terminate the encoder 
and to employ message-passing iterative decoding based on 
the complete Tanner graph representation of the parity-check 
matrix of the code. Although this would be straightforward 
to implement using a standard LDPC block code decoder, 
it would be wasteful of resources, since the resulting (very 
large) block decoder would not be taking advantage of two 
important aspects of the convolutional structure: namely, that 
decoding can be done continuously without waiting for an 
entire terminated block to be received and that the distance 
between two variable nodes that are connected to the same 
check node is limited by the size of the syndrome former 
memory. 



In order to take advantage of the convolutional nature of the 
parity-check matrix, a continuous sliding window message- 
passing iterative decoder that operates on a window of size 
/ • I's variable nodes, where / is the number of decoding 
iterations to be performed, can be implemented, similar to a 
Viterbi decoder with finite path memory IZTl . This window 
size is chosen since, in a single iteration, messages from 
variable (or check) nodes can be passed across a span of 
only one constraint length. Thus, in / iterations, messages can 
propagate only over a window of size / constraint length. (See 
also the recent paper by Papaleo et al. |[28l|, which investigates 
further reducing the window size for codes operating on 
a binary erasure channel (BEC).) Another simplification is 
achieved by exploiting the fact that a single decoding iteration 
of two variable nodes that are at least mg + 1 time units 
apart can be performed independently, since the corresponding 
bits cannot participate in the same parity-check equation. This 
allows the parallelization of the / iterations by employing / 
independent identical processors working on different regions 
of the parity-check matrix simultaneously, resulting in the 
parallel pipeline decoding architecture introduced in |fT9ll . The 
pipeline decoder outputs a continuous stream of decoded data 
after an initial decoding delay of I ■ received symbols. The 
operation of this decoder on the Tanner graph of a simple 
time-invariant rate- 1/3 convolutional code with rus = 2 and 
j/g = 9 is illustrated in Figure 

Although the pipeline decoder is capable of fully paralleliz- 
ing the iterations by using / independent identical processors, 
employing a large number of hardware processors might not be 
desirable in some applications. In such cases, fewer processors 
(even one processor) can be scheduled to perform subsets of 
iterations, resulting in a serial looping architecture ||29l with 
reduced throughput. This ability to balance the processor load 
and decoding speed dynamically is especially desirable where 
very large LDPC convolutional codes must be decoded with 
limited available on-chip memory. Further discussion on the 
implementation aspects of the pipeline decoder can be found 
in l30l . 

C. Unwrapping Techniques due to Tanner and due to Jimenez- 
Feltstrdm and Zigangirov (JFZ) 

In this subsection we discuss two approaches for deriving 
convolutional codes from block codes, in particular for de- 
riving LDPC convolutional codes from LDPC block codes. 
The first technique will be the unwrapping due to Tanner and 
the second will be the unwrapping due to Jimenez-Feltstrom 
and Zigangirov (JFZ). In Section |lll] we will see, with the 
help of graph covers, how these two - seemingly different - 
unwrapping techniques are connected with each other. 

The term unwrapping, in particular unwrapping a quasi- 
cyclic block code to obtain a time-invariant convolutional 
code, was first introduced in a paper by Tanner 1171 (see 
also lfT6l ). That paper describes a link between quasi-cyclic 

^For LDPC convolutional codes the parameter Ub is usually much larger 
than typical values of i^s for "classical" convolutional codes. Therefore the 
value i/s = 9 of the convolutional code shown in Figure [T] is not typical for 
the codes considered in this paper. 
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Fig. 1. Tanner graph of a rate- 1/3 convolutional code and an illustration of pipeline decoding. Here, yoit), yi{t), y2{t) denote the stream of channel output 
symbols, and vo(t), i>i{t), V2{t) denote the stream of decoder code bit decisions. 



block codes and time-invariant convolutional codes and shows 
that the free distance of the unwrapped convolutional code 
cannot be smaller than the minimum distance of the underlying 
quasi-cyclic code. This idea was later extended in lISTl . 1321 . 
Consider the quasi-cyclic block code Cq^ defined by the 



polynomial parity-check matrix Hq^{X) of size 



m X n. I.e., 



^QC — 



{v{X) 



[XY 



Here the polynomial operations are performed modulo X^' — 1. 
The Tanner unwrapping technique is simply based on dropping 
these modulo computations. More precisely, with a quasi- 
we associate the convolutional code 



-(r) 



He, 



cyclic block code Cq(-, 

Cconv = {v{D) e¥2[D]' 

with polynomial parity-check matrix 

HconviD) = Hq^{X) 



X=D 



(3) 



Here the change of indeterminate from X to D indicates the 
lack of the modulo — 1 operations. (Note that in (O we 
assume that the exponents appearing in the polynomials in 
Hq^{X) are between and ?- — 1 inclusive.) 

It can easily be seen that any codeword v{D) in Cconv maps 
to a codeword v{X) in Cq^ through 



v{X) = v{D) mod {W - i; 



D=X 



a process which was described in iflTl as the wrapping around 
of a codeword in the convolutional code into a codeword in 
the quasi-cyclic code. The inverse process was described as 
unwrapping. 

Having introduced the unwrapping technique due to Tanner, 
we move on to discuss the unwrapping technique due to 
JFZ ||T9l , which is another way to unwrap a block code to 
obtain a convolutional code. The basic idea is best explained 
with the help of an example. 




Fig. 2. Deriving a rate _R = 
code with 6 = 1, c = 2, rris 
block code of length 10. 



1/2 periodically time-varying convolutional 
= 4, i/s = 10, and Ts = 5 from a rate-1/2 



Example 1 Consider the parity-check matrix 
1110 111 



H 











111 



1110 1 



1 1 1 1 



10 11 
110 

1 1 



1 1 1 1 1 1 



with size mxn ~ 5x10, of a rate-l/2 block code. As indicated 
above, we can take a pair of scissors and "cut" the parity- 
check matrix into two pieces, whereby the cutting pattern is 
such that we repeatedly move c = 2 units to the right and 
then c — b=l unit down. Having applied this "diagonal cut," 
we repeatedly copy and paste the two parts to obtain the bi- 
infinite matrix shown in Figure\2\ This new matrix can be seen 
as the parity-check matrix of (in general) a periodically time- 
varying convolutional code (here the period is Tg = 5j. It is 
worth observing that this new matrix has the same row and 
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column weights as the matrix that we started with^ □ 

This example can be formalized easily. Namely, starting 
with an m X rt parity-check matrix H of some block code, let 
rj = gcd(m, n). Then the "diagonal cut" is performed by alter- 
nately moving c = n/rj units to the right and then c—b^m/rj 
units down (i.e., b = [{n — m)/rj). The resulting convolutional 
code has rate R^b/c, syndrome former memory rrig = 7^ — 1, 
constraint length = {nis + I) ■ c ^ rj ■ c = n, and period 
Ts = ms + 1 = 77. 

Analogous to the comment at the end of Section III-AI it 
is also possible to cut the matrix H in larger step sizes, e.g., 
moving c' = £-c units to the right and c' — b' = £■ {c — b) units 
down, for any integer i that divides Tg = 77, thereby obtain- 
ing a periodically time-varying convolutional code with rate 
R' = b' jc! = b/c, syndrome former memory m'^ = {rj/i) — 1, 
constraint length ly'^ ~ {m'^ + 1) ■ c' = rj ■ c = n, and period 
Tg = rn'^ + 1 = 7]/£. (See also the discussion in Section lTV-BI ) 

In the rest of this paper, the term "JFZ unwrapping tech- 
nique" will also stand for the following generalization of the 
above procedure. Namely, starting with a length-7i block code 
C defined by some size-7ri x 77, parity-check matrix H, i.e.. 



H 



or}. 



we write H as the sum H = J2eec (i" ^) °f ^ collection 
of matrices {Hi}i^c- The convolutional code Cconv is then 
defined to be 



Hr 



(4) 



where 



Hr 



Ho 



Ho 



H 



\c\-i 



Hi 
H 



\C\-2 
\C\-1 



Ho 



Hi 



£1-2 



Ho 



Referring to the notation introduced in Section lTl-AI the matrix 
Hconv is the parity-check matrix of a time-invariant convolu- 
tional code. However, depending on the decomposition of H 
and the internal structure of the terms in that decomposition, 
the matrix iJconv can also be (and very often is) viewed as 
the parity-check matrix of a time-varying convolutional code 
with non-trivial period Tg. 

In order to illustrate the generalization of the JFZ unwrap- 
ping technique that we have introduced in the last paragraph, 
observe that decomposing H from Example \l\ as H = 
Ho + Hi (in Z) with 



Ho 



00:00000000 

1 'i" 1': 

1 1 1 o'o 'i'lo 000 

1 1 1 ' " 1 : 
1 1 1 1 1 0" 1": 



*In practice, the codewords start at some time, so tlie convolutional parity- 
check matrix has effectively the semi-infinite form of (T), and the row weights 



00:11101110 
'o' O: 1 1 1 

o"o"o';i 1 

' 0" : 1 1 
'o"o": 



of the first Ua — 1 rows are reduced. 



yields a convolutional code with parity-check matrix Hconv 
whose bi-infinite version equals the matrix shown in Figure |2] 



III. Tanner Graphs from Graph Covers 

Having formally introduced LDPC convolutional codes in 
the previous section, we now turn our attention to the main 
tool of this paper, namely graph covers. 

Definition 2 (see, e.g., ||33l ) A cover of a graph G with vertex 
set W and edge set £ is a graph G with vertex set W and 
edge set £, along with a surjection tt : W — > W which is a 
graph homomorphism (i.e., tt takes adjacent vertices of G to 
adjacent vertices of G) such that for each vertex w G W and 
each w € Tr~^{w), the neighborhood d{w) of w is mapped 
bijectively to d{w). A cover is called an M-cover, where M 
is a positive integer, if |7r~^(w)| = M for every vertex w in 

wS □ 

These graph covers will be used for the construction of 
new Tanner graphs from old Tanner graphs, in particular 
for the construction of Tanner graphs that represent LDPC 
convolutional codes. 

More specifically, this section starts by discussing two 
simple methods to specify a graph cover, which will be called 
graph-cover construction 1 (GCCl) and graph-cover con- 
struction 2 (GCC2). Although they yield isomorphic Tanner 
graphs, and therefore equivalent codes, it is convenient to have 
both methods at handO As we will see, interesting classes of 
Tanner graphs can be obtained by repeatedly applying these 
graph-cover constructions, by mixing them, and by suitably 
shortening the resulting codes. Moreover, these two graph- 
cover constructions will allow us to exhibit a connection 
between the Tanner and the JFZ unwrapping techniques. 



A. Graph-Cover Constructions 

Let A be an itla x ua matrix over Z^o- With such a matrix 
we can associate a Tanner graph T{A), where we draw ua 
variable nodes, niA check nodes, and where there are [A]j i 
edges from the i-th variable node to the j-th check nodeQ 
Given the role that the matrix A will play subsequently, we 
follow Il2n and call the matrix A a proto-matrix and the 
corresponding graph T(A) a proto-graph. 

^The number M is also known as the degree of the cover. (Not to be 
confused with the degree of a vertex.) 

''For a formal definition of code equivalence, see for example 1341 . 

'Note that we use a generalized notion of Tanner graphs, where parallel 
edges are allowed and are reflected by corresponding integer entries in the 
associated matrix. 
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The next definition introduces GCCl and GCC2, two ways 
to specify graph covers that will be used throughout the rest 
of the paper3 

Definition 3 For some positive integers ttia and ua, let A G 
^>(f /je a proto-matrix. We also introduce the following 
objects: 

• For some finite set C, let {Ai}i^c be a collection of 
matrices such that Ai G Z™^^""*, £ G £, and such that 

A = Eeec 

• For some positive integer r, let {Pe}ti£C be a collection 
of size-rxr permutation matrices. I.e., for every £ £ C, 
the matrix Pg is such that it contains one "1 " per column, 
one "1" per row, and "0"i otherwise. 

Based on the collection of matrices {Ai}£^c and the col- 
lection of matrices {Pi\i^c> there are two common ways of 
defining a graph cover of the Tanner graph T(A). (In the 
following expressions, I is the identity matrix of size r x r.) 

• Graph-cover construction I (GCCl). Consider the inter- 
mediary matrix 



B' = A®I 



tec 



{At (g> I) 



whose Tanner graph T{B') consists of r disconnected 
copies ofT(A). This is an r-fold cover o/T(A), albeit 
a rather trivial one. In order to obtain an interesting r- 
fold graph cover of A, for each £ £ C, we replace ® J 
by Ai (x) P(, i.e., we define 

B ^ ^ (A, Pf) . 

eec 

Graph-cover consti-uction 2 (GCC2) Consider the inter- 
mediary matrix 
-/ 



B 



I ® A = ^{I ® At) , 

tec 



whose Tanner graph T(i3 ) consists of r disconnected 
copies o/T(A). This is an r-fold cover o/T(A), albeit 
a rather trivial one. In order to obtain an interesting r- 
fold graph cover of A, for each I E C, we replace I®Ag 
by Pg (x) Ag, i.e., we define 

B ^ ^ (P, ® Ag) . 

iec 

If all the matrices {Pg}gec are circulant matrices, then the 
graph covers T(B) and T(B) will be called cyclic covers of 
T(A). □ 

One can verify that the two graph-cover constructions in 
Definition |3] are such that the matrix B, after a suitable 
reordering of the rows and columns, equals the matrix 

*We leave it as an exercise for tlie reader to show tliat tlie graplis constructed 
in GCCl and GCCl are indeed two instances of tlie graph cover definition 
in Definition |2] 

'indeed, a possible approach to show this is to use the fact that Ag (X) Pg 
and Pg ® Ag are permutation equivalent, i.e., there is a pair of permutation 
matrices (Q, Q') such that Ag ® Pg = Q ■ {Pg ® Ag) ■ Q'. Of course, for 
this to work, the pair {Q, Q') must be independent of £ G C, i.e., dependent 
only on the size of the matrices {Ag}ggc and {Pe}eec- Such a {Q,Q') 
pair can easily be found. 



This implies that T{B) and T{B) are isomorphic graphs; 
nevertheless, it is helpful to define both types of constructions. 

B. Graph-Cover Construction Special Cases 

The following examples will help us to better understand 
how GCCl and GCC2 can be used to obtain interesting 
classes of Tanner graphs, and, in particular, how the result- 
ing graph-cover constructions can be visualized graphically. 
Although these examples are very concrete, they are written 
such that they can be easily generalized. 



Example 4 (Cyclic cover) Consider the proto-matrix 



(5) 



with TTiA — 2 and ua ~ 3, and whose Tanner graph T(A) is 
shown in Figure ^a). Let C ^ {0, 1} x {0, 1, 2}, and let the 
collection of matrices {Ag}g£c be given by {Aj_i}j.i, where 
for each j = 0, . . . , ttia — 1 and each i = 0, . . . , ua — 1 the 
rriAxriA defined as follows 



matrix Aj^i G Z>q 



otherwise 



Moreover, let r = 7, and let the collection of matrices {Pe}ieC 
be given by {Pj,i}j,i, where Pq.o = h, Po.i = h, Po,2 = U, 
Pi,o — Iq, Pi,i — I5, Pi. 2 — I3, and where Is is an s times 
left-shifted identity matrix of size r x r. 

• Using GCCl, we obtain the matrices 



B' ^A(E)Io 



lo 
lo 



B 



Ii 



I2 
h 



(6) 



whose Tanner graphs T{B') and T{B), respectively, are 

shown in Figure^b). 

Using GCC2, we obtain the matrices 



B 







A 





























A 





























A 
















® A 













A 





























A 





























A 





























A 







Ai,o 


Ai,i 


Ao.2 


A 


1,2 


^0,1 


-4o,o 


^0,0 







Ai,o 


Ai.x 


A 


0,2 


^1,2 


^0,1 


Ao,i 


Aaa 





Ai,o 


A 


1,1 


^0,2 


Ax.2 


Ai^2 


Ao 


1 


-4o,o 





A 


1,0 


^1,1 


Ao.2 


^0,2 


Ai 


2 


Ao,i 


-4o,o 





^1,0 


Ai 


1 


^1,1 


Aa 


2 


Ax.2 


A^.i 


A 


0,0 





Al.Q 


Al,Q 


Ai 


1 


Aq.2 


Ax.2 


A 


0,1 


Aofi 








(7) 



whose Tanner graphs T{B ) and T{B), respectively, are 
shown in Figure \3j[c). Note that all the block rows and 
all the block columns sum (in TL) to A. (This observation 
holds in general, not just for this example.) □ 
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(a) Proto-graph T{A). 



(a) Proto-graph T{A). 





(b) GCCl based on T(A). Top: T(B'). Bottom: T(B). 



(b) GCCl based on T(A). Top: T(B'). Bottom: T(B). 





(c) GCC2 based on T(A). Top: T(B ). Bottom: T(B). 



(c) GCC2 based on T(A). Top: T(B ). Bottom: T(B). 



Fig. 3. Proto-graph and graph-covers for the graph-cover constructions Fig. 4. Proto-graph and graph-covers for the graph-cover constructions 
discussed in Example|4] (Compare with the con'esponding graphs in Figure|4]) discussed in Example|6] (Compare with the corresponding graphs in Figure|3]) 



We would like to add two comments with respect to the 
above example. 

First, instead of defining Is to be an s times /e/f-shifted 
identity matrix of size r x r, we could have defined Ig to be an 
s times n^/if-shifted identity matrix of size r x r. Compared 
to the matrices and graphs described above, such a change 
in definition would yield (in general) different matrices but 
isomorphic graphs. 

Second, we note that GCC2 was termed the "copy-and- 
permute" construction by Thorpe and his co-workers. This ter- 
minology stems from the visual appearance of the procedure: 
namely, in going from Figure |3j a) to Figure [3jc)(top) we copy 
the graph several times, and in going from Figure [3|c)(top) to 
Figure |3lc)(bottom) we permute the edges of the graph, where 
the permutations are done within the sets of edges that have 
the same pre-image in Figure Oa). 



Remark 5 (Quasi-cyclic codes) Consider again the matrices 
that were constructed in Example in particular the matrix 
A in Q and its r-fold cover matrix B in (|6]l. Because all 
matrices in the matrix collection {P(}iec ^""e circulant, T(B) 
represents a cyclic cover o/T(j4). Clearly, when seen over ¥2, 
B is the parity-check matrix of a quasi- 



(r) A 
QC 



the matrix iJ, 

cyclic binary linear block code 



-(r) 
-QC 



H. 



(r) 
QC 



Using the well-known isomorphism between the addition and 
multiplication of circulant matrices over F2 and the addition 



and multiplication of elements of the ring [X], 
can be written equivalently as 



this code 



^QC 



[v{X) 



H, 



(r) 
QC 



{X)-v{X)^ = 0-'} 
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with 



Using GCC2, we obtain the matrices B 



A and 



-"qc 



x^ x^ 



x^' 
x^ 



As noted above, the graphs T(JB) and T{B) that are 
constructed in Definition \3\ are isomorphic. Applying this 



(r) 
QC 



B with B 



observation to Example |4] the matrix H 
from (|7]l is therefore the parity-check matrix of a binary linear 
block code 



c, 



QC 



rr('-) -T 

Hqc-v 



°1 



that is equivalent to Cqq, i.e., the codewords of Cqq can be 

obtained from the codewords of Cqq by a suitable reorder- 
ing of the codeword components. In terms of the matrices 
{Aj^i}j^i, which also appear in the matrix B in (|7]), one 
can verify that the polynomial parity-check matrix H^^^{X) 



can be written as Hq^{X) - 
AioX^ + AooX^ + Ai.X^ 



OX" + Ao^oX' 



Ao,lX' 



□ 



Besides defining finite graph covers, we can also define 
infinite graph covers, as illustrated in the following examples. 
These infinite graph covers will be crucial towards defining 
Tanner graphs of convolutional codes. 

Example 6 (Bi-inflnite Toeplitz covers) We continue Exam- 
ple |4] However, besides keeping the proto-matrix A and 
the collection of matrices {Aj^i}j^i, we consider a different 

collection of matrices {Pj^i]j^i. Namely, we set Pq.o — Ti, 
T3 At^p At^p At^p At^p At^ 

Here Tg is a bi-infinite Toeplitz matrix with zeros everywhere 
except for ones in the s-th diagonal below the main diagonal, 
i.e., [Tsjj.i = 1 ;/ j = i + s and [Fs\j,i ~ otherwise. E.g., 



Ti 




10 
10 
10 



where for clarity we have underlined the entries of the main 
diagonal. 

• Using GCCl, we obtain the matrices B' = A®Tq and 

Ti T2 T4 

n n T3 



B 



(8) 



The Tanner graph T{B'), which is depicted in Fig- 
ure Wib)(top), is similar to the corresponding Tanner 
graph in Figure \3\b)(top), but with bi-infinitely many 
independent components. Analogously, the Tanner graph 
T(B), which is depicted in Figure^b)(bottom), is similar 
to the Tanner graph shown in Figure \3\b)(bottom), but 
instead of cyclically wrapped edge connections, the edge 
connections are infinitely continued on both sides. 



B = 



























Ao.o 























A 


A 

■^OS) 


n 


n 


n 
u 


n 
u 


n 
u 


U 


Aia 























Ai^2 


















Ai,i 


Ao,2 


Ai^2 


Aq^I 













Aifi 




Aoa 


Aia 


^0,1 















Ai,i 


^0,2 


Ai.2 


Aqa 


-4o,o 






(9) 

The Tanner graph T(B ), which is depicted in Fig- 
ure Wic)(top), is similar to the corresponding Tanner 
graph in Figure \3][c)(top), but with bi-infinitely many 
independent components. Analogously, the Tanner graph 
T(B), which is depicted in Figure^c)(bottom), is similar 
to the Tanner graph shown in Figure \3\c)(bottom), but 
instead of cyclically wrapped edge connections, the edge 
connections are infinitely continued on both sides. □ 

Although it is tempting to replace in Example |6] the bi- 
infinite Toeplitz matrices Ts (whose row and column index sets 
equal Z) by semi-infinite Toeplitz matrices (whose row and 
column index sets equal Z^o)> note that the resulting Tanner 
graphs T(S) and T(B) would then in general not be graph 
covers of T(A). This follows from the fact that semi-infinite 
Toeplitz matrices are not permutation matrices (except for To), 
and so some vertex degrees of T(B) and T(B) would not 
equal the corresponding vertex degrees in T(A)I"1 

Remark 7 It turns out that the Tanner graphs in Figure^are 
infinite graph covers of the Tanner graphs in Figure \3] More 
precisely, the Tanner graphs T{B'), T[B), T(B ), T(JB) 
in Figure |4] are graph covers of the corresponding Tanner 
graphs T(B'), T{B), T(!b'), J(B) in Figure E] For the 
Tanner graphs T(B') in Figures\3\b)(top) and^b)(top) and 
the Tanner graphs T{B ) in Figures^c)(top) and^c)(top), 
this statement is easily verified by inspection. 

To verify that the Tanner graph T{B) in Figure^c)(bottom) 
is a graph cover ofT{B) in Figure\3\c){bottom), we apply 
GCC2 with proto-matrix A, with resulting matrix B, with 
the set C, with the collection of matrices {Ae}i£c, <^nd with 
the collection of permutation matrices {Pi}e£C cis follows. 
Namely, we let the proto-matrix A be the matrix from ^ 

'"As will be clear from the discussion later on, in this paper we take an 
approach where in a first step we construct bi-infinite Tanner graphs that 
are "proper" graph covers and where in a second step we obtain semi-infinite 
Tanner graphs by applying a "shortening" procedure to these bi-infinite Tanner 
graphs. Alternatively, one could also choose an approach based on "improper" 
graph covers. Both approaches have their advantages and disadvantages; we 
prefeiTed to take the first approach. 
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(there denoted by B), we let the resulting matrix B be the 
matrix in ^ (there denoted by B), we define C = {0, 1}, we 
select 



An 





^0,1 
^0,2 






-4o,i 

^1,2 
Ao,2 







Ax,2 








^0,1 




















^1,0 A\^\ j4o,2 ^1,2 ^0,1 ^0,( 



(10) 






-4i,o 


^1,1 


-4o,2 


Ax.2 


Aa.i 


-4o.o 








Axfl 


^1,1 


-4o.2 


Ai.2 


Ao^i 











Axfl 


Ai.i 


Aa.2 


Ax.2 














-4i.o 


Ai.i 


Ao.2 

















Ai.o 






















Ax.Q 
























(11) 



and we select Pq = Tq and Pi = Ti, where T\ was defined 
in Example |6] Clearly, A = Aa + Ai (in ZjO With this we 
have 



B = Po «> Ao + Pi (g) Ai = 



Ao 
Ai Aq 
Ai Aa 
Ai Ao 



and one can verify that this matrix equals the matrix in (|9]l 
(there denoted by B), which means that T{B) is indeed an 
infinite cover of T(A). We remark that, interestingly, in this 
process we have shown how a certain GCC2 graph cover of 
a proto-matrix can be written as a GCC2 graph cover of a 
certain GCC2 graph cover of that proto-matrix. 

Finally, a similar argument shows that the Tanner graph 
T{B) in Figure^b)(top) is a graph cover of the Tanner graph 
in Figure\3][b)(top), also denoted by T{B). □ 

There are many other ways of writing a proto-matrix A as 
a sum of a collection of matrices {A^}fg£. The next example 
discusses two such possibilities. 

Example 8 Consider the proto-matrix 

^0011101110^ 
0111010011 



A 



1110 1110 
1 1 1 1 1 1 
1 1 1 1 1 1 



that is shown in Figure \5][a), and that also appeared in 
Example E] Its Tanner graph T{A) is (3, 6)-regular, i.e., all 
variable nodes have degree 3 and all check nodes have degree 
6. Let C = {0, 1}, and consider the collection of matrices 

"Note that a non-zero block diagonal of A would be put in Aq. 



{Pe}£ec with Pq ~ Tq and Pi = Ti, where the matrices Tq 
and Ti are defined as in Example^ In the following, we look 
at two different choices of the collection of matrices {A^ }fg£. 

• Figure\5\c) shows a typical part of the matrix B that is 
obtained when GCC2 is used to construct a graph cover 
of A with the collection of matrices {A^j^g^ defined as 
shown in Figure\5][a). 

• Figure \5][d) shows a typical part of the matrix B when 
GCC2 is used to construct a graph cover of A with 
the collection of matrices {Ai\g^c defined as shown in 
Figure\5\b). 

Overall, because of the choice of the collection {Pi\i^c> the 
support of both matrices B possesses a banded diagonal 
structure. Moreover, the different choices of the collection 
{Afjfgc leads to a somewhat narrower banded diagonal 
structure in the first case compared to the second case. □ 

The next example makes a crucial observation; namely, it 
shows that the above graph-cover constructions can be applied 
repeatedly to obtain additional interesting classes of Tanner 
graphs. 

Example 9 (Iterated Graph-Cover Construction) Starting 
with the proto-matrix A from Example |4] we consider two 
iterated graph-cover constructions. In the first case, we apply 
GCCl and then GCC2, and in the second case we apply 
GCC2 twice. 

• Consider the matrix B obtained from the matrix A using 
GCCl, like in Example |4] The resulting matrix B is 
shown in ^ and will be called A^^-* in this example, 
since it is considered to be a proto-matrix by itself, 
cf. Figure ^a). Based on the "cutting line" shown in 
Figure ^a), we define the matrices Aq^'* and A^^^ as 
follows: the non-zero part of Ag^^ equals the non-zero 
part of the lower triangular part of A^^^ and the non- 
zero part of A^'-p equals the non-zero part of the upper 
triangular part o/ A^^). (Clearly, A^^) = Aq^-* + A^^^j 
Applying the procedure from Example |S] Figure ^c) 
shows a typical part of the matrix B^ ■* that is obtained 
when GCC2 is used to construct a graph cover of A'^^\ 

• Consider the graph-cover B obtained from A using 
GCC2, like in Example |4] The resulting matrix B is 
shown in (|7]l and will be called A^^-* in this example, 
since it is considered to be a proto-matrix by itself, 
cf. Figure ^b). Based on the "cutting line" shown in 
Figure ^b), we define the matrices aI^-* and aI^'' as 

(2) 

follows: the non-zero part of Ag equals the non-zero 
part of the lower triangular part of A*^^^ and the non- 
zero part of aJ^'' equals the non-zero part of the upper 



triangular part of A^"^^ . (Clearly, A^^^ = A, 



(2) 



l(2) 



Applying the procedure from Example |S] Figure [dfc/j 

(2) 

shows a typical part of the matrix B that is obtained 
when GCC2 is used to construct a graph cover of A^^\ 

We observe a large difference in the positions of the non- 
zero entries in B^^'^ and 

• In the first case, the two graph-cover constructions are 
"incompatible " and the positions of the non-zero entries 
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(a) First decomposition of tlie matrix A into tlie matrices Aq and Ai. 



(b) Second decomposition of tlie matrix A into the matrices Ao and Ai. 
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(c) Part of the matrix B based on the first decomposition of A. 



Fig. 5. Matrices appearing in Example [8] (See main text for details.) 



Ai 



(d) Part of the matrix B based on the second decomposition of A. 



in B follow a "non-simple" or "pseudo-random" 
pattern. As we will see in Example [7S| with the help 
of simulation results, such Tanner graphs can lead to 
time-varying LDPC convolutional codes with very good 
performance. 

In the second case, the two graph-cover constructions 
are "compatible " in the sense that B can be obtained 
from the proto-matrix A by applying GCC2 with suitable 
matrix collections {Ai\iizc and {Pi}ei£C- such, the 

(2) 

positions of the non-zero entries of B follow a rela- 
tively "simple" or "non-random" pattern, which leads to 
a time-invariant LDPC convolutional code. □ 



The above procedure of obtaining two matrices that add up 
to a matrix is called "cutting a matrix". Actually, we will also 
use this term if there is no simple cutting line, as in the above 
examples, and also if the matrix is written as the sum of more 
than two matrices (cf. Example [T] and the paragraphs after it). 



C. Revisiting the Tanner and the JFZ Unwrapping Techniques 

In Section III-CI we introduced two techniques, termed 
the Tanner and the JFZ unwrapping techniques, to derive 
convolutional codes from block codes. In this subsection we 
revisit these unwrapping techniques. In particular, we show 
how they can be cast in terms of graph covers and how the 
two unwrapping techniques are connected. 

Because of the importance of the coding-theoretic notion 
of shortening ll34l for this subsection, we briefly revisit this 
concept. Let be a parity-check matrix that defines some 
length-n binary code C. We say that the length- (71 — 1) code 
C is obtained by shortening C at position i if 



C'=|(vo, . 



In terms of parity-check matrices, a possible parity-check 
matrix H' of C is obtained by deleting the i-th column of 
H. In terms of Tanner graphs, this means that the Tanner 
graph T{H') is obtained from T(i?) by removing the i-th 
variable node, along with its incident edges. In the following. 
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(a) Matrix A^'^^ 



(b) Matrix 




(c) Part of matrix B 



Part of matrix B 



Fig. 6. Matrices appearing in Example [9] (See main text for details.) 



we will also use the term "shortening" to denote this graph 
modification procedure. 

Now, to explain the Tanner unwrapping technique in terms 

(r) 

of graph covers, consider the quasi-cyclic block code Cq^ 

S(^) of 



defined by the polynomial parity-check matrix if, 
size niA x nA, i-e.. 



r('-) _ 

^QC ~ 



.{D)-v{D)^ = 0''} 



with polynomial parity-check matrix 



-ffconv(-D) = Hq^{X) 



X=D 



Again, the change of indeterminate from X Xo D indicates the 
lack of modulo D"^ — 1 operations. 

In the following we will give, with the help of an example, 
two interpretations of the Tanner unwrapping technique in 
terms of graph covers. 



(r) 

Example 10 Unwrapping the quasi-cyclic block code Cq^ 
that was considered in Remark\5\ we obtain a rate-l/'S time- 
invariant convolutional code 

Cconv = [v{D) e F2[i?]"^ I H,,UD) ■ v{DY = qt} 



with polynomial parity-check matrix 



He, 



AD) 



Consider now the infinite graph covers that were constructed 
in Example^using GCCl, in particular T(B). Let C(T(J5)) 
be the set of codewords defined by the Tanner graph T(B). 
Then the convolutional code Cconv is a shortened version of 
C(T(B)) where all codeword bits corresponding to negative 
time indices have been shortened. Therefore, the Tanner graph 
o/ Cconv is given by the Tanner graph in Figure^b)(bottom), 
where all bit nodes with negative time indices, along with their 
incident edges, are removed. Clearly, this bit-node and edge 
removal process implies decreasing the degrees of some check 
nodes. In fact, some check nodes become obsolete, because 
their degree is decreased to zero. □ 

Therefore, one interpretation of the Tanner unwrapping 
technique in terms of graph covers is that the Tanner graph of 
the convolutional code is obtained by taking a suitable graph 
cover of the same proto-graph that was used to construct the 
quasi-cyclic LDPC code, along with some suitable shortening. 

Example 11 We continue Remark\5\and Example \10\ Clearly, 
in the same way as the block code Cqq is equivalent to the 

(r) — 

block code Cqq, we can define a code Cconv ( with parity-check 
matrix Hconv) that is equivalent to Cconv The observations in 
Remark [7| and Example \TU\ can then be used to show that the 
Tanner graph of Hconv equals a graph cover of the Tanner 



(r) 

graph Hqq, along with some suitable shortening. 



□ 



where the polynomial operations are performed modulo X"^ — ! 
(see also Remark |5]l. As already mentioned in Section III-CI 
the Tanner unwrapping technique is simply based on dropping 
these modulo computations. More precisely, with a quasi- 

(r) i 

cyclic block code Cq^, we associate the convolutional code tj 

Cconv = {t^P)eF2[i?]" 



Therefore, the second interpretation of the Tanner unwrap- 
ping in terms of graph covers is that the Tanner graph of the 
convolutional code is obtained by taking a suitable graph cover 
of the Tanner graph of the quasi-cyclic code, along with some 
suitable shortening. 

Now turning our attention to the JFZ unwrapping technique, 
recall from Section III-CI that this method is based on writing 
a parity-check matrix H of some block code C as the sum 
H = '^(^cHg (in Z) of a collection of matrices {Hg}^^^. 
The convolutional code is then defined to be 



Hr 



(12) 



where 



Hr 



Ho 
Hi 



Ho 



H\c\-i H\c\-2 ■ ■ ■ Hq 

H\C\-1 H\c\~2 ■■■ Hq 



(13) 



With the help of an example, we now explain how the JFZ 
unwrapping technique can be cast in terms of graph-covers. 

Example 12 Consider the infinite graph covers that were 
constructed using GCC2 in Example |^ in particular T{B). 
Let C(T(B)) be the set of valid assignments to the Tanner 
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graph T{B). Moreover, let H = Hq + Hi ^ \- = 

+ Aofi + AoA + ^1,2 + ^0,2 + ^1,1 + ^1,0, and let Cconv 
be defined as in (1121) . Then the code Cconv is a shortened 
version of C(T(B)), where all codeword bits corresponding 
to negative time indices have been shortened. Therefore, the 
Tanner graph of Cconv is given by the Tanner graph in 
Figure^c)(bottom), where all the bit nodes with negative time 
indices are shortened. D 

In order to connect the unwrapping techniques due to Tanner 
and due to JFZ, we show now, with the help of an example, 
that in fact the unwrapping technique due to Tanner can be 
seen as a special case of the unwrapping technique due to 
JFZ0 

Example 13 Consider the cjuasi-cyclic block code defined by 
the parity-check matrix Hqq = B, where B was defined 
in (|7]l. Applying the JFZ unwrapping technique with the matrix 
decomposition Hqq = Aq + Ai (in Z), with Aq defined 
in ilOi and Ai defined in (II 11 1. Hconv turns out to equal a 
submatrix of B in (|9]), namely the submatrix of B where the 
row and column index set are equal to Z^q. However, the 
code defined by Hconv is equivalent to the code defined by 
the Tanner unwrapping technique applied to the quasi-cyclic 
code defined by Hqq. □ 

Therefore, the unwrapping technique due to JFZ is more 
general. In fact, whereas the Tanner unwrapping technique 
leads to time-invariant convolutional codes, the unwrapping 
technique due to JFZ can, depending on the parity-check 
matrix decomposition and the internal structure of the terms 
in the decomposition, lead to time-varying convolutional codes 
with non-trivial period 

Despite the fact that the unwrapping technique due to Tanner 
is a special case of the unwrapping technique due to JFZ, it 
is nevertheless helpful to have both unwrapping techniques 
at hand, because sometimes one framework can be more 
convenient than the other. We will use both perspectives in 
the next section. 

We conclude this section with the following remarks. 

• Although most of the examples in this section have reg- 
ular bit node degree 2 and regular check node degree 3, 
there is nothing special about this choice of bit and check 
node degrees; any other choice would work equally well. 

• Although all polynomial parity-check matrices that ap- 
pear in this section contain only monomials, this is not re- 
quired, i.e., the developments in this section work equally 
well for polynomial parity-check matrices containing the 
zero polynomial, monomials, binomials, trinomials, and 
so on. 

• It can easily be verified that if the matrix A in Defini- 
tion [3] contains only zeros and ones, then the graph covers 
constructed in GCCl and GCC2 never have parallel 

'^We leave it as an exercise for the reader to sliow the validity of this 
connection beyond this specific example. 

'^Of course, if the underlying quasi-cyclic block code is suitably chosen, 
then also the Tanner unwrapping technique can yield a time-varying convo- 
lutional code; however, we do not consider this option here. 



edges. In particular, if A is the parity-check matrix of 
a block code (like in most examples in this paper), then 
the constructed graph covers never have parallel edges. 
However, if A contains entries that are larger than one, 
then there is the potential for the constructed graph covers 
to have parallel edges; if parallel edges really appear 
depends then critically on the choice of the decomposition 
A = J2eec ^) ^^'^ choice of the permutation 

matrices {Pt}e£c- An example of such a case is the Tan- 
ner graph construction in Section IV-CI where A = [33] 
and where {A^}^g£ and {Pt}eec chosen such that 
parallel edges are avoided in the constructed graph cover 
We note that in the case of iterated graph-cover con- 
structions it can make sense to have parallel edges 
in the intermediate graph covers. However, in the last 
graph-cover construction stage, parallel edges are usually 
avoided, because parallel edges in Tanner graphs typically 
lead to a weakening of the code and/or of the message- 
passing iterative decoder. 

IV. Graph-Cover Based Constructions of 
LDPC Convolutional Codes 

Although the graph-cover constructions and unwrapping 
techniques that were discussed in Sections HH and Hill are math- 
ematically quite straightforward, it is important to understand 
how they can be applied to obtain LDPC convolutional codes 
with good performance and attractive encoder and decoder 
architectures. To that end, this section explores a variety of 
code design options and comments on some practical issues. 
It also proposes a new "random" unwrapping technique which 
leads to convolutional codes whose performance compares 
favorably to other codes with the same parameters. Of course, 
other variations than the ones presented here are possible, 
in particular, by suitably combining some of the example 
constructions. 

The simulation results for the codes in this section plot 
the decoded bit error rate (BER) versus the signal-to-noise 
ratio (SNR) E\j/No and were obtained by assuming BPSK 
modulation and an additive white Gaussian noise channel 
(AWGNC). All decoders were based on the sum-product 
algorithm 1351 and were allowed a maximum of 100 itera- 
tions, with the block code decoders employing a syndrome- 
check based stopping rule. For comparing the performance of 
unwrapped convolutional codes with their underlying block 
codes we will use the following metric. 

Definition 14 For a convolutional code constructed from an 
underlying block code, we define its "convolutional gain " to 
be the difference in SNR required to achieve a particular BER 
with the convolutional code compared to achieving the same 
BER with the block code. □ 

The rest of this section is structured as follows. First we 
discuss the construction of some time-invariant LDPC con- 
volutional codes based on the Tanner unwrapping technique. 
In this context we make a simple observation about how the 
syndrome former memory can sometimes be reduced without 
changing the convolutional code. Secondly, we present a 
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i"l 55,64] QC code 
[240,98] QC code 
[400,162] QC code 

R=2/5 time-invariant conv. code with Vg=145 
R=2/5 time-varying conv. code with Vg=155 
iR=2/5 time-varying conv. code with Vg=240 
;R=2/5 time-varying conv. code with Vg=400 



Eb/N„ (dB) 



Fig. 7. Performance of three (3,5)-regular quasi-cyclic LDPC blocif codes and 
their associated time-invariant and time-varying LDPC convolutional codes. 
(Note that the small gaps that appear between the second, third, and fourth 
curves for high signal-to-noise ratios are caused by a slight difference in code 
rates due to the existence of redundant rows in the block code parity-checlt 
matrices.) 



construction of time-varying LDPC convolutional codes based 
on iterated graph-cover constructions. An important sub-topic 
here will be an investigation of the influence of the "diagonal 
cut" (which is used to define a graph cover) on the decoding 
performance. 

A. Construction of Time-Invariant LDPC Convolutional 
Codes Based on the Tanner Unwrapping Technique 

In this section we revisit a class of quasi-cyclic LDPC 
codes and their associated convolutional codes that was studied 
in ll36l . As we will see, they are instances of the quasi- 
cyclic code construction in Example |4] and Remark |5] and 
the corresponding convolutional code construction based on 
Tanner's unwrapping technique in Example [TOl 

Example 15 Consider the regular proto-matrix 



(14) 



with ruA = 3 and ua — 5. We apply GCCl, as in Example |4] 
and Remark \5\ with an interesting choice of permutation 
matrices first suggested by Tanner l[37]l that yields the parity- 
check matrix 



H. 



(r) 
QC 



h 


h 


h 


Ts 


I16 


h 


Iio 






hs 


/25 


Il9 


I7 




I28 



(15) 



where as before Ig is an s times left-circularly shifted identity 
matrix of size rxr and r > 28. The corresponding polynomial 
parity-check is 

'X^ X^ 

X5 ^10 ^20 ^9 ^18 
^25 ^19 X' 



The resulting quasi-cyclic (3, b)-regular LDPC block codes 
have block length n = 5 ■ r. In particular, for ?' = 31, r = 48, 



and r = 80, we obtain codes of length 155, 240, and 400, 
respectively, whose simulated BER performance results are 
shown in Figure^ The choice r = 31 yields the well-known 
length-155 quasi-cyclic block code that was first introduced 
by Tanner l[37]l (see also the discussion in MS'i ). 

Unwrapping these codes by the Tanner unwrapping tech- 
nique as in Example \10\ we obtain a rate-2/b time-invariant 
convolutional code with = 145 defined by the polynomial 
parity-check matrix 





















^20 








^25 


^19 






JJ2S 



Its decoding performance is also shown in Figure [7| under the 
label "R = 2/5 time-invariant conv. code with i^g = 145." We 
conclude this example with a few remarks. 

« Figure [7| shows that the convolutional code exhibits a 
"convolutional gain" of between 0.5 dB and 0.7 dB 
compared to the [155, 64] quasi-cyclic LDPC block code 
at moderate BERs and that the gain remains between 

0. 15 dB and 0.3 dB at lower BERs. 

« Note that the polynomial parity-check matrix HcomiD) 
that is obtained by the Tanner unwrapping technique is 
independent of the parameter r of the polynomial parity- 
check matrix Hqq[X), as long as r is strictly larger than 
the largest exponent appearing in H^'q{X). Moreover, 

for r — >■ 00, the Tanner graph of Hq^{X) is closely 
related to the Tanner graph of Hcom{D), and so it is 
not surprising to see that, for larger r, the decoding 
performance of quasi-cyclic LDPC block codes based on 
Hq^{X) tends to the decoding performance of the LDPC 
convolutional based on Hconv{D), as illustrated by the 
two curves labeled "[240, 98] QC code" and "[400, 162] 
QC code" in Figure^ 
• The permutation matrices (more precisely, the circulant 
matrices) that were used for constructing the quasi-cyclic 
codes in this example were not chosen to optimize the 
Hamming distance or the pseudo-weight properties of 
the code. In particular, a different choice of circulant 
matrices may result in better high-SNR performance, 

1. e., in the so-called "error fioor" region of the BER 
curve. For choices of codes with better Hamming distance 
properties, we refer the reader to h38^ . 

« The remaining curves in Figure [7| will be discussed in 
Example [T^ □ 

We conclude this subsection with some comments on the 
syndrome former memory ttt-s of the convolutional codes 
obtained by the Tanner unwrapping technique, in particular 
how this syndrome former memory rris can sometimes be 
reduced without changing the convolutional code. 

Assume that we have obtained a polynomial parity-check 
matrix HconviD) from Hq^{X) according to the Tanner 
method. Clearly, the syndrome former memory mg is given 
by the largest exponent that appears in HconviD). In some 
instances there is a simple way of reducing mg without 
changing the convolutional code. Namely, if e is the minimal 
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exponent that appears in the polynomials of a given row of 
Hconv{D), then the polynomials in this row of Hcom{D) can 
be divided by D"^. We illustrate this syndrome former memory 
reduction for the small convolutional code that appeared in 
Example [TO] 

Example 16 Applying the Tanner unwrapping technique to 
the polynomial parity-check matrix Hq^{X) of the quasi- 
cyclic LDPC code with r ~ 7 in Remark |5] we obtain 
Hconv{D) of a rate-1/3 time-invariant LDPC convolutional 
code, as shown in Example\l^ with syndrome former memory 
nis = 6. Following the procedure discussed in the paragraph 
above, the first and second rows of Hconv{D) can be divided 
by and D^, respectively, to yield an equivalent convo- 
lutional code with syndrome former memory nis = 3 and 
polynomial parity-check matrix 

^3 jj2 jjO ■ 



(16) 



Figure^shows parts of the corresponding scalar parity-check 
matrix Hconv for mg — 3, together with the original scalar 
parity-check matrix for rrig ~ 6, and illustrates the equivalence 
of the two matrices in the sense that only the ordering of 
the rows is different, which does not affect the corresponding 
convolutional code. In this example, the order of the even- 
numbered rows stays the same, while the odd-numbered rows 
are shifted by four positions. The equivalence of the two parity- 
check matrices can be seen by noting that the parity-check 
matrix, outside of the diagonal structure, is filled with zeros. 

□ 



B. Construction of Time-Varying LDPC Convolutional Codes 
Based on Iterated Graph-Cover Constructions 

As was seen in Example |9] interesting graph covers can 
be obtained by combining GCCl with GCC2, or vice-versa. 
Inspired by that example, this subsection considers iterated 
graph-cover constructions for constructing Tanner graphs of 
LDPC convolutional codes, in particular of time-varying 
LDPC convolutional codes. 

Definition 17 Based on a combination of GCCl, GCC2, and 

the code-shortening concept introduced in Section 17/7- CI we 
propose the following construction of LDPC convolutional 
codes. 

1) We start with a proto-matrix A of size niA x ha- 

2) We apply GCCl to A with finite-size permutation ma- 
trices and obtain the matrix A'. 

3) We apply GCCl to A' with permutation matrices that 
are bi-infinite Toeplitz matrices and obtain the matrix 
A". 

4) Finally, looking at A" as the parity-check matrix of a bi- 
infinite convolutional code, we obtain the parity-check 
matrix of a convolutional code by shortening the code 
bit positions corresponding to negative time indices. 

Here, Steps\3\and^can be seen as an application of the JFZ 
unwrapping method. □ 



000000001000010100000 
►1 00010000001000000000 

000000001000010100000 
100010000001000000000 

000000001 000010100000 
100010000001 000000000 

000000001 000010100000 
100010000001 000000000 

(a) 

001000010100 
'1 00010000001 

001000010100 
100010000001 
000000001000010100 00 
100010000001 

001000010100 
100010000001 



(b) 



Fig. 8. Parts of the scalar parity-check matrices (see 0) corresponding to the 
two equivalent LDPC convolutional codes with syndrome former memories 
(a) nis = 6 and (b) rrig = 3. 



The following example shows how this construction can be 
applied to obtain LDPC convolutional codes with excellent 
performance. (In the example, where suitable, we will refer to 
the analogous matrices of Example |9] and Figure |6] that were 
used to illustrate the iterated graph-cover construction.) 

Example 18 Based on Definition [73 we construct an LDPC 
convolutional code by performing the following steps. 

1) We start with the same regular proto-matrix A as in 
Example [75] for which uia = 3 and ua = 5. 

2) We apply GCCl to A with permutation matrices that are 
circulant matrices of size r x r and obtain the parity- 
check matrix A' ~ -H^qc shown in ( I15l l, which is the 
analogue o/ A^^-* in Figure^a). 

3) We apply GCC2 to A' — Hqq with permutation 
matrices that are bi-infinite Toeplitz matrices and obtain 
a new parity-check matrix A" . This is analogous to the 
transition of the matrix A*-^' in Figure^a) to the matrix 

in Figure^c). The "diagonal cut" is obtained by 
alternately moving ua — 5 units to the right and then 
•niA = 3 units down. 

4) Finally, we obtain the desired convolutional code by 
shortening the code bit positions corresponding to neg- 
ative time indices. 

For the choices r = 31, 48, 80, this construction results 
in rafe-2/5 time-varying convolutional codes with syndrome 
former memory mg = 30, 47, 79, respectively, and with 
constraint length Vg — {nis + 1) ■ ^a = 155, 240, 400, respec- 
tively. The label "time-varying" is indeed justified because the 
convolutional codes constructed here can be expressed in the 
form of the parity-check matrix in ([TJ with a suitable choice 
of syndrome former memory nis, non-trivial period Tg, and 
submatrices 

The decoding performance of these codes is shown in 
Figure [2 labeled "R = 2/5 time-varying com. code with 
Vs = . . .". As originally noted in 4391/ . we observe that these 
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three LDPC convolutional codes achieve significantly better 
performance at a BER of 10~® than the other codes shown 
in this plot, namely with " convolutional gains" of 2.0 dB for 
the Vs — 155 convolutional code, 2.4 dB for the Vs = 240 
convolutional code, and 2.8 dB /or the Vg = 400 convolutional 
code, compared to the three respective underlying LDPC block 
codes. 

In order to compare these codes based on a given decoding 
processor (hardware) complexity, we consider a block code 
of length n = Vs (see H40^ and 114 The above time- 
varying convolutional code for r = 31 has constraint length 
Vs = (jTi-s + 1) ■ c = 155, and hence approximately the same 
processor complexity as the quasi-cyclic block code of length 
n = 155 in Figure [7| and the time-invariant convolutional 
code with Vs = 145 in Figure but it achieves large gains 
compared to both of these codes. We note, in addition, that 
the performance of the time-varying convolutional code with 
constraint length Vs = 400 is quite remarkable, since, at a 
BER o/10~^, it performs within 1 dB of the iterative decoding 
threshold of 0.965 dB, while having the same processor 
complexity as a block code of length only n = 400. In Section 
\VI-C\ we discuss some possible reasons for these "convolu- 
tional gains," along with their associated implementation costs 
in terms of decoder memory and decoding delay. □ 

We make the following observations with respect to the 
above definition and example. 

• The LDPC code construction in the above example yields 
time-varying LDPC convolutional codes with syndrome 
former memory ms ^ r — 1 and period Tg = r. 
Most importantly, varying r in the above construction 
leads to different LDPC convolutional codes. This is in 
contrast to the Tanner unwrapping technique discussed 
in Section ITV-AI where the obtained LDPC convolutional 
code is independent of the parameter r, as long as r is 
strictly larger than the largest exponent in Hq^{X). 

• As mentioned previously in Example |9] the iterated 
graph-cover construction based on the combination of 
GCCl and GCC2 yields Tanner graphs that have a 
"pseudo-random" structure, a structure that seems to be 
beneficial as indicated by the above simulation results. 
(We remark that the improved performance of the time- 
varying LDPC convolutional codes obtained by unwrap- 
ping a randomly constructed LDPC block code was first 
noted by Lentmaier et al. in B2l .) 

• Instead of constructing a first parity-check matrix as in 
Step |2] of Definition [17] one can also start with any 
other (randomly or non-randomly constructed, regular 
or irregular) parity-check matrix, and still achieve a 
"convolutional gain." The next example is an illustration 
of this point. 

Example 19 As was done in l[41]l . one can replace the parity- 
check matrix that was constructed in Step |2] of Definition \17\ 
by an irregular LDPC block code with optimized iterative 
decoding thresholds. In particular, one can start with the 
parity-check matrix of the rate-1/2 irregular proto-graph- 
based code from l\43]l with an iterative decoding threshold of 
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Fig. 9. Performance of a family of irregulai' proto-graph-based LDPC block 
codes and the associated time-varying LDPC convolutional codes. 

0.63 dB, and several of its punctured versions. Figure\9\shows 
simulation results for the obtained block and convolutional 
codes. Each simulated block code had a block length of about 
2500, with code rates ranging from 1/2 to 4/5. We see that 
"convolutional gains" ranging from 0.6 dB to 0.9 dB at a 
BER of 10~^ were obtained. 

Similarly, it was shown in l[24]I that an LDPC convolutional 
code derived from a randomly constructed rate-1/2 irregular 
LDPC block code with block length 2400 outperformed the 
underlying code by almost 0.8 dB at a BER of 10^^. The 
degree distribution of the underlying LDPC block code was 
fully optimized and had an iterative decoding threshold of 
0.3104 dB SB- □ 

Of course, there are other ways of applying the "diagonal 
cut" in Step[3]of Example [TSl and so it is natural to investigate 
the influence of different "diagonal cuts" on the decoding 
performance. We will do this in the next few paragraphs 
by extending the discussion that was presented right after 
Example [T] 

We start by assuming that the matrix after Step |2] of 
Definition [TT] has size m x n, and define rj = god (to, n). Then, 
for any positive integer i that divides r/, we can perform a 
"diagonal cut" where we alternately move c' = I - {n/rj) units 
to the right and then c' — b' = £ ■ {m/rj) units down (i.e., 
b' = £ ■ (y{n — m)/r]). With this, the obtained convolutional 
code is a periodically time-varying LDPC convolutional code 
with rate R' = b' /c' ^b/c, syndrome former memory m'^ ~ 
[n/c') -1 = {rj/i) - 1, period = + 1 = n/c' = ri/£, 
and constraint length v'^ = c' ■ (to^ + 1) = n. (Note that the 
syndrome former memory depends on I, but the constraint 
length vi. is independent of I.) 

Example 20 Here we simulate the performance of some 
LDPC convolutional codes obtained according to the above 
generalization of the "diagonal cut." Namely, we start with a 
randomly-constructed (3, 6)-regular LDPC block code based 
on a parity-check matrix of size 1024 x 2048. Therefore 
TO = 1024, n = 2048, and -q = gcd{m, n) = 1024. (Note 
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Fig. 10. Performance of a family of LDPC convolutional codes obtained 
from a (3, 6)-regular LDPC block code using different step sizes. 

that d = i ■ {n/rf) = 2£ and b' = £ ■ {{n - m)/ij) = £ in 
this case.) Figure [70| shows the performance of the resulting 
family of LDPC convolutional codes, where £ varies in powers 
of 2 from 1 to 1024, each with constraint length v'^ = 2048. 
We make the following observations. First, the case I = 1024 
is not interesting because it results in m'^ = 0, i.e., it is a 
trivial concatenation of copies of the block code, and so the 
BER is the same as for the underlying block code. Secondly, 
for all other choices of £, the constructed codes perform 
very similarly, each exhibiting a sizable "convolutional gain " 
compared to the block code, although the syndrome former 
memory m'^ is different in each case. D 

A special case of the above code construction deserves 
mention. When 77 = 1, i.e., m and n are relatively prime, 
the only possible step size is obtained by choosing £ = 77 = 1, 
which results in the above-mentioned uninteresting case of 
trivial concatenations of copies of the block code. However, 
all-zero columns can be inserted in the parity-check matrix 
such that a value of 77 > 1 is obtained, which allows a step 
size to be chosen that results in a convolutional code with 
m'g > 0. The variable nodes corresponding to the all-zero 
columns are not transmitted, i.e., they are punctured, so that 
the rate corresponds to the size of the original parity-check 
matrix. 

For the "diagonal cut" LDPC convolutional code construc- 
tions discussed above, the unwrapped convolutional codes 
have the minimum possible constraint length I'l, which is equal 
to the block length of the underlying block code. Although this 
is a desirable property for practical implementation, we do not 
need to limit ourselves to diagonal cuts in general. 

Inspired by the graph-cover construction of Figures |5jb) 
and |3d) in Example [8] instead of a "diagonal cut" we now 
consider a "random cut," which we define as a partition of 
the parity-check matrix into two matrices that add up (over 
Z) to the parity-check matrix. Despite the randomness of this 
approach, several of the key unwrapping properties of the 
"diagonal cut" are preserved. For example, the computational 
complexity per decoded bit does not change, since the degree 



Fig. 1 1 . Performance of "randomly unwrapped" LDPC convolutional codes 
obtained from a (3, 6)-regular LDPC block code using random partitions. 

distributions of the resulting codes are all equal0 However, 
the LDPC convolutional codes based on a "random cut" 
typically require larger decoding processor sizes as a result 
of increased code constraint lengths. 

Example 21 We continue Example \20\ however, instead of 
performing "diagonal cuts," we perform "random cuts." Fig- 
ure\T7\shows the performance of five such LDPC convolutional 
codes, each with rate 1/2 and constraint length v'^ = 4096, 
compared to the underlying block code and the LDPC con- 
volutional code constructed in Example |20| ( with parameters 
£ = 1, 6' = 1, c' = 2, and v'^ = 2048 j. We note that the 
increase in constraint length from v'^ ~ 2048 to v!. = 4096 
due to the "random cut" results in a small additional coding 
gain in exchange for the larger decoding processor size. D 

Finally, we note that, for a size m x n sparse parity-check 
matrix H with p nonzero entries, there are a total of 2™" 
possible ways of choosing a random cut. However, due to 
the sparsity, there are only 2^ distinct random cuts, where 
p m ■ n. 

V. Connections to Other LDPC Codes Based on 
Graph-Cover Constructions 

In this section we briefly discuss some other graph-cover- 
based LDPC code constructions proposed in the literature, 
namely by Ivkovic et al. IS, by Divsalar et al. ED, ED, by 
Lentmaier et al. 1461, 113, and by Kudekar et al. BSll . 

A. LDPC Code Construction by Ivkovic et al. 

The LDPC code construction by Ivkovic et al. in 1441 can 
be seen as an application of the graph-cover construction in 
Figures |3b) and |3d) in Example [8] Namely, in terms of our 
notation, Ivkovic et al. 1441 start with a parity-check matrix H, 
choose the set C ~ {0, 1}, a collection of zero-one matrices 

'■*This is guaranteed by choosing a random partition of the block code 
parity-check matrix and then using this partition to construct one period of 
the time-varying convolutional code parity-check matrix. 
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{Hq, Hi} such that H ~ Hq + Hi (in Z), and the collection 
of permutation matrices {Pq, Pi}eec = {[oi]'[io]} - Most 
importantly, the decomposition of H into Hq and Hi is done 
such that trapping sets that were present in the Tanner graph 
of H are not present in the Tanner graph of the new parity- 
check matrix. In addition, Ivkovic et al. give guarantees on 
the relationship between the minimum Hamming distances of 
the old and new codeF^ 



B. LDPC Code Construction by Divsalar et al. 

One of the LDPC code constructions by Divsalar et al. 
in ED, ED is the so-called rate-1/2 AR4JA LDPC code 
construction, which was also considered earlier in Example [T9] 
A particularly attractive, from an implementation perspective, 
version of this code construction is obtained by an iterated 
graph-cover construction procedure, where each graph-cover 
construction is based on a cyclic cover, as in the application 
of GCCl in Example |4] Although cyclic covers result in 
simplified encoding and decoding circuitry, codes based on 
cyclic covers are known to have the disadvantage that the 
minimum Hamming distance is upper bounded by a number 
that is a function of the proto-graph structure P9l , ifSOl . 
However, because the cyclic cover of a cyclic cover of the 
proto-graph is not necessarily a cyclic cover of the proto- 
graph, such disadvantages are avoided to a certain extent in the 
AR4JA LDPC code construction. Nevertheless, ultimately the 
minimum Hamming distance of such codes will also be upper 
bounded by some number; however, these bounds usually 
become relevant only beyond the code length of interest0 



C. LDPC Code Construction by Lentmaier et al. and Kudekar 
et al. 

The LDPC code constructions by Lentmaier et al. P6l , pTl 
and Kudekar et al. 148] can also be seen as iterated graph- 
cover constructions. We now describe a specific instance of 
this construction. 

> It starts with a proto-matrix A ^ [ ^ 3 ] . 

• The first graph-cover construction is very similar to the 
bi-infinite graph-cover construction in Example |6] and 
Figure |4] Namely, in terms of our notation, we define 
the set C = {0,1,2,3,4,5}, the collection of matrices 
{Aiji^c with Ao ^ Al = A2 ^ [10] and A3 
Ai = A5 = [01], and the collection of permutation 
matrices {Pi}i^c with Po = Tq, Pi = Ti, P2 = T2, 
P3 = To, P4 ^Ti, P5 = T2, where as before Ts is a 
bi-infinite Toeplitz matrix with zeros everywhere except 
for ones in the s-th diagonal below the main diagonal. 

• The second graph-cover construction is a random graph- 
cover construction of cover-degree M. 

• The code is shortened. Namely, for some positive integer 
L all codeword indices corresponding to values outside 

'^See also the discussion of similar results in 1491 Appendix J]. 

'*For this statement we assume that the degree of the first cover is fixed. 



the range [—LA/, LM] are shortened^ 
We now point out some differences between this code con- 
struction and the LDPC convolutional code construction in 
Definition [17] Namely, the LDPC code ensemble constructed 
above has the following properties. 

> The first graph-cover construction is based on bi-infinite 
Toeplitz permutation matrices, and the second graph- 
cover construction is based on finite-size permutation 
matrices. 

« The analysis focuses on the case where M and L go to 
infinity (in that order), i.e., for a fixed L the parameter 
M tends to infinity. Afterwards, L tends to infinity. 

> The number of check nodes with degree smaller than 6 
in the Tanner graph is proportional to Al . 

• In |f48l, for the binary erasure channel, when M and L 
go to infinity (in that order), Kudekar et al. prove that 
the sum-product algorithm decoding threshold for a slight 
variation of the above-mentioned ensemble of codes 
equals the maximum a-posteriori decoding threshold for 
the ensemble of (3, 6)-regular LDPC codes. This is a very 
remarkable property! (In lISTl . using density evolution 
methods, Lentmaier et al. give numerical evidence that 
this statement might also hold for binary-input output- 
symmetric channels beyond the binary erasure channel.) 

On the other hand, the codes constructed in Definition [TTl have 
the following properties. (We assume that the underlying block 
code is a (3, 6)-regular LDPC code.) 

> The first graph-cover construction is based on finite- 
size permutation matrices, and the second graph-cover 
construction is based on bi-infinite Toeplitz permutation 
matrices. 

• In a typical application of this construction, r is fixed. 

• The number of check nodes with degree smaller than 6 
in the Tanner graph of the LDPC convolutional code is 
proportional to r. 

> For a binary-input output-symmetric channel, the per- 
formance of the unterminated LDPC convolutional code 
under the continuous sliding window sum-product algo- 
rithm decoding discussed in Section III-BI improves with 
increasing r (see, e.g., Fig.|7]i, but the ultimate asymptotic 
threshold of such unterminated decoding is unknown^ 

The differences between these two code families come mainly 
from the fact that the codes constructed by Lentmaier et 
al. and Kudekar et al. are essentially block codes, although 
sophisticated ones, whereas the codes in Definition [17] are 
convolutional codes, along with their advantages and disad- 
vantages. In particular, the way the limits of the parameters 

'^Although this code construction method could be presented such that the 
shortening is done between the two graph-cover construction steps, namely 
by shortening all codeword indices that correspond to values outside the 
range [— L, L], we have opted to present the code construction such that 
the shortening is done after the two graph-cover construction steps. In this 
way, the structure of the code construction description matches better the 
description in Definition 1171 

'^Lentmaier et al. have shown in 1461 and 1471 that properly terminated 
LDPC convolutional codes become equivalent to the LDPC block codes 
constructed by Kudekar et al. in |48| and inherit their excellent asymptotic 
threshold properties, but whether this is true for unterminated LDPC convo- 
lutional codes is still an open question. 
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are taken, there is a significant difference in the fraction of 
check nodes with degree strictly smaller than 6. Namely, in 
the case of the codes by Lentmaier et al. and Kudekar et 
al. this fraction is a fixed non-zero function of L (here we 
assume fixed L and M — > oo), whereas in the case of the 
codes considered in this paper, this fraction is zero (here we 
assume fixed r and an unterminated convolutional code). 

We conclude this section with the following remarks. 
Namely, although the convolutional codes in Definition [17] 
may not enjoy the same asymptotic thresholds as the block 
code constructions by Lentmaier et al. and by Kudekar et al, 
they lend themselves to a continuous decoding architecture, 
as described in Section III-BI which can be advantageous 
in certain applications, such as data streaming, without a 
predetermined frame structure. More importantly, however, 
it is very encouraging that the simulation results reported 
in this paper indicate that sizable "convolutional gains" are 
already visible for very reasonable constraint/code lengths. In 
the next section we discuss some possible reasons for these 
gains. Finally, it is worth noting that, as the block lengths and 
associated constraint lengths of the constructions presented in 
this section become larger, the observed "convolutional gains" 
will become smaller since the block code results will approach 
their respective thresholds. 

VI. Analysis of Derived 
LDPC Convolutional Codes 

This section collects some analytical results about LDPC 
convolutional codes. In particular, we compare the existence / 
non-existence of cycles in LDPC block and LDPC convolu- 
tional codes, we present some properties of pseudo-codewords, 
and we discuss the — mostly moderate — cost increase in 
decoder complexity that is incurred by going from LDPC 
block to LDPC convolutional codes. 

A. Graph-Cycle Analysis 

It is well known that cycles in the Tanner graph repre- 
sentation of a sparse code affect message-passing iterative 
decoding algorithms, with short cycles generally pushing the 
performance further away from optimum. (Indeed, attempts to 
investigate and minimize these effects have been made in ||52l 
and 15311 , where the authors propose LDPC code construction 
procedures to maximize the connectivity of short cycles to the 
rest of the graph, thus also maximizing the independence of 
the messages flowing through a cycle.) Hence it is common 
practice to design codes that do not contain short cycles, so as 
to obtain independent messages in at least the initial iterations 
of the decoding process. 

Avoiding cycles in Tanner graphs also has the benefit of 
avoiding pseudo-codewords To see this, let the active part 
of a pseudo-codeword be defined as the set of bit nodes 
corresponding to the support of the pseudo-codeword, along 

"Here and in the following, pseudo-codewords refer to pseudo-codewords 
as they appear in linear programming (LP) decoding 1541 . 1551 and in the 
graph-cover-based analysis of message-passing iterative decoding in 1221 . 
1231 . For other notions of pseudo-codewords, in particular computation tree 
pseudo-codewords, we refer to the discussion in 1561 . 



TABLE I 

Average (per bit node) number Ni of cycles of length i for the 
Tanner graphs of the block codes (BCs) of block length n and 
convolutional codes (CCs) of constraint length Us discussed 
IN Example[22] (All Tanner graphs have girth 8.) 



Code 


Ns 


^10 


^12 


EC in = 155) 


3.000 


24.000 


146.000 


BC (n = 240) 


2.600 


14.000 


93.400 


EC (n = 400) 


2.200 


12.400 


70.600 


Time-invariant CC (us = 145) 


2.200 


12.400 


70.200 


Time-varying CC (i/g = 155) 


0.910 


8.342 


44.813 


Time-varying CC (u^ = 240) 


0.917 


5.338 


30.242 


Time-varying CC (i/g = 400) 


0.675 


4.705 


24.585 



with the adjacent edges and check nodes. With this, it holds 
that the active part of any pseudo-codeword contains at least 
one cycle and/or at least one bit node of degree one. And 
so, given that the typical Tanner graph under consideration in 
this paper does not contain bit nodes of degree one, the active 
part of a pseudo-codeword must contain at least one cycle. 
Therefore, avoiding cycles implicitly means avoiding pseudo- 
codewords]^ 

Let H and H be two parity-check matrices such that T{H) 
is a graph cover of T{H). It is a well-known result that any 
cycle in T{H) can be mapped into a cycle in T{H). This 
has several consequences. In particular, the girth of T{H) is 
at least as large as the girth of T{H), and more generally, 
T{lk) contains fewer short cycles than T(i?)E3 For the codes 
constructed in this paper, this means that the unwrapping 
process (from block code to convolutional code) can "break" 
some cycles in the Tanner graph of the block code. 

We now revisit some codes that where discussed in earlier 
sections and analyze their graph cycle structure using a brute- 
force search algorithmic Note that, in order to accurately 
compare the graph cycle distributions of two codes with dif- 
ferent block/constraint lengths, we compute the total number 
of cycles of a given cycle length per block/constraint length, 
and divide this number by the block/constraint length|3 

Example 22 Consider the LDPC block and convolutional 
codes that were constructed in Examples \15\ and \18\ and whose 
BER performance was plotted in Figure Table shows the 
average number of cycles of certain lengths for the Tanner 
graphs of the quasi-cyclic block codes, for the Tanner graph 
of the corresponding time- invariant convolutional code, and 

^"Note that the support of any pseudo-codeword is a stopping set 1221 , 1231 . 

^'This observation has been used in many different contexts over the past 
ten years in the construction of LDPC and turbo codes; in particular, it was 
used in 1421 . where the authors dealt with bounding the girth of the resulting 
LDPC convolutional codes. 

-^The search technique that we used is based on evaluating the diagonal 
entries of the powers of the matiix M defined in 1331 Eq. (3.1)]. Note that 
this search technique works only for counting cycles of length smaller than 
twice the girth of the graph. For searching longer cycles, more sophisticated 
algorithms are needed. 

^^For LDPC convolutional codes, we have made use of the periodicity of 
the parity-check matrices in order to complete the search in a finite number 
of steps. 
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TABLE II 

Average (per bit node) number Ni of cycles of length I for the 
Tanner graphs of the block codes (BCs) of block length n and 
convolutional codes (CCs) of constraint length Us discussed 
IN Example[23] (All Tanner graphs have girth 4.) 



Code 


N4 




Rate-1/2 EC (n = 2500) 


0.013 


0.120 


Rate-2/3 EC (n = 2520) 


0.065 


0.839 


Rate-3/4 EC (n = 2520) 


0.136 


2.710 


Rate-4/5 EC (n = 2508) 


0.250 


6.544 


Rate-1/2 time- varying CC (u^ = 2500) 


0.010 


0.064 


Rate-2/3 time- varying CC [u^ = 2520) 


0.044 


0.483 


Rate-3/4 time- varying CC (Ub = 2520) 


0.091 


1.465 


Rate-4/5 time-varying CC (Us = 2508) 


0.173 


3.622 



for the Tanner graph of the time-varying convolutional codes. 

□ 

Example 23 Table |^ shows the cycle analysis results for 
the rate-1/2 proto-graph-based codes that were discussed 
in Example [79] and whose BER performance was plotted in 
Figure ID □ 

From Examples |22] and [23] we see that many of the short 
cycles in the Tanner graphs of the LDPC block codes are 
"broken" to yield cycles of larger length in the Tanner graphs 
of the derived LDPC convolutional codes. 

B. Pseudo-Codeword Analysis 

This section collects some comments concerning the 
pseudo-codewords of the parity-check matrices under consid- 
eration in this paper 

We start by observing that many of the statements that were 
made in f36l about pseudo-codewords can be extended to the 
setup of this paper In particular, if some parity-check matrices 
H and H are such that T(jy') is a graph cover of T{H), 
then a pseudo-codeword of H can be "wrapped" to obtain a 
pseudo-codeword of H, as is formalized in the next lemma. 

Lemma 24 Let the parity-check matrices H and H be such 
that T{H) is an M -fold graph cover ofT{H). More precisely, 
1st H = ^^^^ Hi ® Pi for some set C, for some collection of 
parity-check matrices {Hi\iqc such that H ~ X^^ez: 
Zj, and for some collection of M x M permutation matrices 
{Pi}iec- Moreover, let X be the set of column indices of H 
and let I X M with M = {0, 1, . . . , Af - 1} be the set of 
column indices of H. With this, if uj ^ i'^{i,m)){i,m)eXxM 
a pseudo-codeword of H, then lj ~ {^jJiji^i with 

~ 7^ X! C'« ^) (17) 

is a pseudo-codeword of H. 

Proof: (Sketch.) There are different ways to verify this 
statement. One approach is to show that, based on the fact 
that u! satisfies the inequalities that define the fundamental 
polytope of H 1221 . ||231 . Ii54l . Ii55l . a; satisfies the inequalities 



that define the fundamental polytope of H. (We omit the 
details.) Another approach is to use the fact that pseudo- 
codewords with rational entries are given by suitable projec- 
tions of codewords in graph covers 1221 . 1231 . So, for every 
pseudo-codeword uj of H with rational entries, there is some 
graph cover of T(iJ') with a codeword in it, which, when 
projected down to T(lf), gives u). However, that graph cover 
of 'T{H) is also a graph cover of T(ff), and so this codeword, 
when projected down to T{H), gives uj as defined in ( [TtI i. (We 
omit the details; see l36l for a similar, but less general, result.) 

■ 

One can then proceed as in |[36l and show that the AWGNC, 
the BSC, and the BBC pseudo-weights O, El, |[23l, l54l . 
(SI, l58l of Cj will be at least as large as the corresponding 
pseudo- weights of w. As a corollary, the minimum AWGNC, 
BSC, and BEC pseudo-weights of H are, respectively, at least 
as large as the corresponding minimum pseudo-weights of 
H. Similar results can also be obtained for the minimum 
Hamming distance. 

Because the high-SNR behavior of linear programming 
decoding is dominated by the minimum pseudo-weight of 
the relevant parity-check matrix, the high-SNR behavior of 
linear programming decoding of the code defined by H is at 
least as good as the high-SNR behavior of linear programming 
decoding of the code defined by iJ0 

In general, because of the observations made in Sec- 
tion IVI-AI about the "breaking" of cycles and the fact that 
the active part of a pseudo-codeword must contain at least 
one cycle, it follows that the unwrapping process is beneficial 
for the pseudo-codeword properties of an unwrapped code, 
in the sense that many pseudo-codewords that exist in the 
base code do not map to pseudo-codewords in the unwrapped 
code. It is an intriguing challenge to better understand this 
process and its influence on the low-to-medium SNR behavior 
of linear programming and message-passing iterative decoders, 
in particular, to arrive at a better analytical explanation of the 
significant gains that are visible in the simulation plots that 
were shown in Section [IV] To this end, the results of l46l 
and [|48] with respect to some related code families (see the 
discussion in Section [Vj will be very helpful, since they 
indicate that some of the features of the fundamental polytope 
deserve further analysis. 

C. Cost of the " Convolutional Gain" 

In this subsection, we investigate the cost of the convolu- 
tional gain by comparing several aspects of decoders for LDPC 
block and convolutional codes. In particular, we consider 
the computational complexity, hardware complexity, decoder 
memory requirements, and decoding delay. More details on the 
various comparisons described in this section can be found in 

Eol, iol, H]. 

LDPC block code decoders and LDPC convolutional code 
decoders have the same computational complexity per decoded 
bit and per iteration since LDPC convolutional codes derived 
from LDPC block codes have the same node degrees (row and 

^''We neglect here the influence of the multiplicity of the minimum pseudo- 
weight pseudo-codewords. 
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column weights) in their Tanner graph representations, which 
determines the number of computations required for message- 
passing decoding. 

We adopt the notion of processor size to characterize the 
hardware complexity of implementing the decoder. A de- 
coder's processor size is proportional to the maximum number 
of variable nodes that can participate in a common check 
equation. This is the block length n for a block code, since 
any two variable nodes in a block can participate in the same 
check equation. For a convolutional code, this is the constraint 
length i^s, since no two variable nodes that are more than 
positions apart can participate in the same check equation. The 
constraint lengths of the LDPC convolutional codes derived 
from LDPC block codes of length n satisfy ^ n. Therefore, 
the convolutional codes have a processor size less than or equal 
to that of the underlying block code. 

On the other hand, the fully parallel pipeline decoding 
architecture penalizes LDPC convolutional codes in terms of 
decoder memory requirements (and decoding delay/latency) as 
a result of the / iterations being multiplexed in space rather 
than in time. The pipeline decoder architecture of Figure [T| 
consists of / identical processors of size performing / 
decoding iterations simultaneously on independent sections 
of a decoding window containing / constraint lengths of 
received symbols. This requires / times more decoder memory 
elements than an LDPC block code decoder that employs 
a single processor of size n ^ Vs performing / decoding 
iterations successively on the same block of received symbols. 
Therefore, the decoder memory requirements and the decoding 
delay of the pipeline decoder are proportional to Vg ■ I, whereas 
the block decoder's memory and delay requirements are only 
proportional to n. Another way of comparing the two types of 
codes, preferred by some researchers, is to equate the block 
length of a block code to the memory/delay requirements, 
rather than the processor size, of a convolutional code, i.e., 
to set 11 = Vs ■ I. In this case the block code, now having a 
block length many times larger than the constraint length of the 
convolutional code, will typically (depending on /) outperform 
the convolutional code, but at a cost of a much larger hardware 
processor. Finally, as noted in Section [III the parallel pipeline 
decoding architecture for LDPC convolutional codes can be 
replaced by a serial looping decoding architecture, resulting 
in fewer processors but a reduced throughput along with the 
same memory and delay requirements. 

In summary, the convolutional gain achieved by LDPC 
convolutional codes derived from LDPC block codes comes 
at the expense of increased decoder memory requirements and 
decoding delays. Although this does not cause problems for 
some applications that are not delay-sensitive (e.g., deep-space 
communication), for other applications that are delay-sensitive 
(e.g., real-time voice/video transmission), design specifications 
may be met by deriving LDPC convolutional codes from 
shorter LDPC block codes, thus sacrificing some coding gain, 
but reducing memory and delay requirements, or by employing 
a reduced window size decoder, as suggested in the recent 
paper by Papaleo et al. ll28l . with a resulting reduction in the 
"convolutional gain." 
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VII. Conclusions 

In this paper we showed that it is possible to connect 
two known techniques for deriving LDPC convolutional codes 
from LDPC block codes, namely the techniques due to Tanner 
and due to Jimenez-Feltstrom and Zigangirov. This connection 
was explained with the help of graph covers, which were also 
used as a tool to present a general approach for constructing 
interesting classes of LDPC convolutional codes. Because it is 
important to understand how the presented code construction 
methods can be used — and in particular combined — we then 
discussed a variety of LDPC convolutional code constructions, 
along with their simulated performance results. 

In the future, it will be worthwhile to extend the presented 
analytical results, in particular to obtain a better quantitative 
understanding of the low-to-medium SNR behavior of LDPC 
convolutional codes. In that respect, the insights in the papers 
by Lentmaier et al. 1461 . BtI and Kudekar et al. ||48| on the 
behavior of related code families will be valuable guidelines 
for further investigation. 
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